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Abstract 

Abduction is a fundamental and important form of non- 
monotonic reasoning. Given a knowledge base explaining 
how the world behaves it aims at finding an explanation 
for some observed manifestation. In this paper we focus on 
propositional abduction, where the knowledge base and the 
manifestation are represented by propositional formulae. The 
problem of deciding whether there exists an explanation has 
been shown to be Sj-complete in general. We consider vari- 
ants obtained by restricting the allowed connectives in the 
formulae to certain sets of Boolean functions. We give a com- 
plete classification of the complexity for all considerable sets 
of Boolean functions. In this way, we identify easier cases, 
namely NP-complete and polynomial cases; and we highlight 
sources of intractability. Further, we address the problem of 
counting the explanations and draw a complete picture for the 
counting complexity. 

Introduction 

Abduction is a fundamental and important form of non- 
monotonic reasoning. Assume that given a certain consis- 
tent knowledge about the world, we want to explain some 
observation. This task of finding an explanation or only 
telling if there is one, is called abduction. Today it has 
many application areas spanning medical diagnosis (By- 
lander et al. 1989), text analysis (Hobbs et al. 1993), sys- 
tem diagnosis (Stumptner and Wotawa 2001), configuration 
problems (Amilhastre, Fargier, and Marquis 2002), temporal 
knowledge bases (Bouzid and Ligeza 2000) and has connec- 
tions to default reasoning (Selman and Levesque 1990). 

There are several approaches to formalize the problem of 
abduction. In this paper, we focus on logic based abduction 
in which the knowledge base is given as a set T of proposi- 
tional formulae. We are interested in deciding whether there 
exists an explanation E, i.e., a set of literals consistent with 
r such that r and E together entail the observation. 

From a complexity theoretic viewpoint, the abduction 
problem is very hard in the sense that it is X^-complete 
and thus situated at the second level of the polynomial hi- 
erarchy (Eiter and Gottlob 1995). This intractability result 
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raises the question for restrictions leading to fragments of 
lower complexity. Several such restrictions have been con- 
sidered in previous works. One of the most famous amongst 
those is Schaefer's framework, where formulae are restricted 
to generalized conjunctive normal form with clauses from a 
fixed set of relations (Creignou and Zanuttini 2006; Nordh 
and Zanuttini 2005; 2008). 

A similar yet different procedure is to rather require for- 
mulae to be constructed from a restricted set of Boolean 
functions B. Such formulae are called B-formulae. This ap- 
proach has first been taken by Lewis, who showed that the 
satisfiability problem is NP-complete if and only if this set 
of Boolean functions has the ability to express the negation 
of implication connective /> (Lewis 1979). Since then, this 
approach has been applied to a wide range of problems in- 
cluding equivalence and implication problems (Reith 2003; 
Beyersdorff et al. 2009a), satisfiability and model check- 
ing in modal and temporal logics (Bauland et al. 2006; 
2008), default logic (Beyersdorff et al. 2009b), and circum- 
scription (Thomas 2009), among others. 

We follow this approach and show that Post's lattice al- 
lows to completely classify the complexity of propositional 
abduction for several variants and all possible sets of al- 
lowed Boolean functions. We first examine the case where 
the representation of the manifestation is a literal. We show 
that depending on the set B of allowed connectives the ab- 
duction problem is either Incomplete, or NP-complete, or 
in P and ©LOGSPACE-hard, or in LOGSPACE. More 
precisely, we prove that the complexity of this abduction 
problem is X^-complete as soon as B can express one of 
the functions x V (y A -*z), iA(i/V -iz) or (x A y) V (x A 
-iz) V (y A -iz). It drops to NP-complete when all functions 
in B are monotonic and have the ability to express one of the 
functions x V(y A z), x A (y Vz) or (xAy) V (xAz) V(j/Az). 
The problem becomes solvable in polynomial time and is 
©LOGSPACE-hard if S-formulae may depend on more 
than one variable while being representable as linear equa- 
tions. Finally the complexity drops down to LOGSPACE in 
all remaining cases. 

We then examine several variants of the propositional ab- 
duction problem. The variants considered are obtained by 
restricting representation of the manifestation to be respec- 
tively a clause, a term or a B-formula. We present a complete 
classification in all cases. An overview of the results is given 



in Figure 1 . Our results highlight the sources of intractabil- 
ity and exhibit properties of Boolean functions that lead to 
an increase of the complexity of abduction. 

In (Creignou and Zanuttini 2006) the authors obtained a 
complexity classification of the abduction problem for for- 
mulae which are in generalized conjunctive normal form, 
with clauses from a fixed set of relations. The two classi- 
fications are in the same vein since they classify the com- 
plexity of abduction for local restrictions on the knowledge 
base. However the two results are incomparable, in the sense 
that no classification can be deduced from the other. They 
only overlap on the particular case of the linear connective 
©, for which both types of sets of formulae can be seen as 
systems of linear equations. This special abduction case has 
been shown to be decidable in polynomial time in (Zanuttini 
2003). 

Besides the decision problem, another natural question 
is concerned with the number of explanations. This prob- 
lem refers to the counting problem for abduction. The 
study of the counting complexity of abduction has been 
started by Hermann and Pichler (2007). We prove here a 
trichotomy theorem showing that counting the full explana- 
tions of propositional abduction problems is either #-coNP- 
complete or ^P-complete or in FP, depending on the set B 
of allowed connectives. 

The rest of the paper is structured as follows. We first 
give the necessary preliminaries. Afterwards, we define the 
abduction problem considered herein. We then classify the 
complexity of the abduction of a single literal. These results 
are complemented with the complexity of the abduction 
problem for clauses, terms and restricted formulae. Next, we 
consider the counting problem and finally conclude with a 
discussion of the results. 

Preliminaries 

Complexity Theory We require standard notions of com- 
plexity theory. For the decision problems the arising com- 
plexity degrees encompass the classes LOGSPACE, P, NP, 
and Y%. For more background information, the reader is re- 
ferred to (Papadimitriou 1994). We furthermore require the 
class ©LOGSPACE defined as the class of languages L 
such that there exists a nondeterministic logspace Turing 
machine that exhibits an odd number of accepting paths if 
and only if x G L, for all x (Buntrock et al. 1992). It holds 
that LOGSPACE C ©LOGSPACE C P. For our hardness 
results we consider logspace many-one reductions, defined 
as follows: a language A is logspace many-one reducible 
to some language B (written A <j£ g B) if there exists a 
logspace-computable function f such that x G A if and only 
if f(x)€B. 

A counting problem is represented using a witness func- 
tion w, which for every input x returns a finite set of wit- 
nesses. This witness function gives rise to the following 
counting problem: given an instance x, find the cardinal- 
ity |iu(x)| of the witness set w(x). The class #P is the 
class of counting problems naturally associated with de- 
cision problems in NP. According to (Hemaspaandra and 
Vollmer 1995) if C is a complexity class of decision prob- 
lems, we define #-C to be the class of all counting problems 



whose witness function is such that the size of every witness 
y of x is polynomially bounded in the size of x, and check- 
ing whether y G w(x) is in C. Thus, we have #P = #-P 
and #P C #-coNP. Completeness of counting problems is 
usually proved by means of Turing reductions. A stronger 
notion is the parsimonious reduction where the exact num- 
ber of solutions is conserved by the reduction function. 

Propositional formulae We assume familiarity with propo- 
sitional logic. The set of all propositional formulae is de- 
noted by C. A model for a formula tp is a truth assignment to 
the set of its variables that satisfies tp. Further we denote by 
tp[a/f3] the formula obtaine from ip by replacing all occur- 
rences of a with f3. For a given set V of formulae, we write 
Vars(r) to denote the set of variables occurring in T. We 
identify finite T with the conjunction of all the formulae in 
r, A^er f- F° r an y formula tp G C, we write T \= ip if T 
entails ip, i.e., if every model of V also satisfies ip. 

A literal I is a variable x or its negation -ix; x is called the 
atom of I and is denoted by |/|. Given a set of variables V, 
Lits(y) denotes the set of all literals formed upon the vari- 
ables in V, i.e., LitsfV) = V U {->x | x G V}. A clause is a 
disjunction of literals and a term is a conjunction of literals. 

Clones of Boolean Functions A clone is a set of Boolean 
functions that is closed under superposition, i.e., it contains 
all projections (that is, the functions f(a\, . . . ,a n ) = a,k 
for 1 < k < n and n G N) and is closed under arbitrary 
composition. Let B be a finite set of Boolean functions. We 
denote by [B] the smallest clone containing B and call B a 
base for [B]. All closed classes of Boolean functions were 
identified by Post (1941). Post also found a finite base for 
each of them and detected their inclusion structure, hence 
the name of Post's lattice (see Figure 1). 

In order to define the clones, we require the following no- 
tions, where / is an n-ary Boolean function: 

- / is c-reproducing if /(c, . . . , c) = c, c € {0, 1}. 

- / is monotonic if ai <b\,a2 < b 2 , ■ ■ ■ , a n < b n implies 
f(ai, . . . ,a n ) < f(h, . . .,b n ). 

- f is c-separating of degree k if for all A G f (c) of 
size \A\ = k there exists an i G {1, . . . , n} such that 
(oi, . . . , a n ) G A implies flj = c, c e {0, 1}. 

- / is c-separating if / is c-separating of degree |/ _1 (c)|. 

- / is self-dual if / = -i/(-iXi, . . . , ->x„). 

- / is affine if / = X\ © • ■ • © x n © c with c G {0, 1}. 

A list of all clones with definitions and finite bases is given 
in Table 1 on page 9, see also e.g., (Bohler et al. 2003). A 
propositional formula using only functions from B as con- 
nectives is called a B-formula. The set of all i?-formulae is 
denoted by C(B). Let / be an n-ary Boolean function. A B- 
formula <ps such that Vars(<,s) = {xi, . . . , x n , y\, . . . , y m } is 
a B- representation of / if for all a\, . . . , a n , b\, . . . , b m G 
{0, 1} it holds that f(a\, . . . , a n ) = 1 if and only if every 
a: Vars(<p) — > {0, 1} with <r(xi) = ai and cr(yi) = bi for 
all relevant i, satisfies <p>. Such a ^-representation exists for 
every / G [B]. Yet, it may happen that the ^-representation 
of some function uses some input variable more than once. 



Example 1. Let h(x, y) = xA^y. An { /^-representation of 
the function x A y is h(x, h(x, y)). 

Observe that if B\ and B 2 are two sets of Boolean func- 
tions such that B\ C [B 2 ], then every function of B\ can be 
expressed by a £? 2 -formula, its so-called B2-representation. 

The Abduction Problem 

Let B be a finite set of Boolean functions. We are interested 
in a propositional abduction problem parameterized by the 
set B of allowed connectives. We define the abduction prob- 
lem for B-formulae as 

Problem: Abd(B) 

Instance: V = (T, A, <p), where 

- T is a set of B-formulae, T C £(B), 

- A is a set of variables, A C Vars(r), 

- ip is a formula, ip e £ with Vars(<^) C Vars(r) \ A. 
Question: Is there a set C Lits(A) such that r A E is 

satisfiable and T AE |= ip (or equivalently r A E A -up 
is unsatisfiable)? 

The set T represents the knowledge base. The set A is called 
the set of hypotheses and <p is called manifestation or query. 
Furthermore, if such a set E exists, it is called an explana- 
tion or a solution of the abduction problem. It is called a full 
explanation if Vars(_E) = A. Observe that every explana- 
tion can be extended to a full one. 

We will consider several restrictions on the manifesta- 
tions of this problem. To indicate these restrictions, we in- 
troduce a second argument Ai: in the abduction problem 
Abd(_B, M), ip is required to be a single literal if M = Q, 
a clause if M = C, a term if M = T, and a B-formula if 
M = £(B). 

Let us start with a lemma that makes clear the role of the 
two constants and 1 in our problem. 

Lemma 2. Let B be a finite set of Boolean functions 

1. IfM G {Q, C, T, C{B)}, then 

Abd(B, M) =|S s Abd(B U {1}, M) 

2. IfM G {Q, C, T} and V G [B], then 

ABD(B, M) =lS s Abd(B U {0}, M) 

Proof. To reduce Abd(B U {1},M) to Abd{B,M) we 
transform any instance of the first problem in replacing ev- 
ery occurrence of 1 by a fresh variable t and adding the 
unit clause (t) to the knowledge base. To prove ABD(i3 U 
{0},M) = l °z Abd(B,M), let V = (T,A,ip) be an in- 
stance of the first problem and / be a fresh variable. If 
M. G {Q, C, T}, then we can suppose w.l.o.g. that ip does 
not contain 0. We map V to V = (V, A U {/}, ip), where 
T' is the S-representation of {ip[0/f] V / | <p G V}. 

The Complexity of Abd(S, Q) 

Theorem 3. Let B be a finite set of Boolean functions. 
Then, the abduction problem for propositional B-formulae, 
Abd(_B, Q), is 



1. T^-complete ifS a2 Q [B] or S 12 C [B] or D 1 C [B], 

2. NP-complete ifS 00 Q [B] C M or S w Q [B] C M or 
D 2 C [B] C M, 

3. in P and (BLOGSP ACE-hard if L 2 C [_B] C L, and 

4. in LOGSPACE in a/Z ofwer caiei. 

Remark 4. For such a classification a natural question is: 
given B, how hard is it to determine the complexity of 
Abd(_B,Q)? Solving this task requires checking whether 
certain clones are included in [B] (for lower bounds) and 
whether B itself is included in certain clones (for upper 
bounds). As shown in (Vollmer 2009), the complexity of 
checking whether certain Boolean functions are included in 
a clone depends on the representation of the Boolean func- 
tions. If all functions are given by their truth table then the 
problem is in quasi-polynomial-size AC , while if the input 
functions are given in a compact way, i.e., by circuits, then 
the above problem becomes coNP-complete. 

We split the proof of Theorem 3 into several propositions. 

Proposition 5. Let B be a finite set of Boolean functions 
such that [B] C E or [B] C N or [B] C V. Then 
Abd(B, Q) g LOGSPACE. 

Proof. Let V = (T, A, q) be an instance of Abd(B, Q). 

For [B] = N or E, r is equivalent to a set of literals, hence 
V has the empty set as a solution if V possesses a solution 
at all. Finally notice that satisfiability of a set of N-formulae 
can be tested in logarithmic space (Schnoor 2005). 

For [B] — V each formula ip G T is equivalent to either a 
constant or disjunction. It holds that (r, A, q) has a solution 
if and only if T contains a formula ip = q V x\ V • • • V Xk 
such that X := {x\, ...n}Ci, and T[A/0] is satisfiable. 
This can be tested in logarithmic space, as substitution of 
symbols and evaluation of V-formulae can all be performed 
in logarithmic space. 

Proposition 6. Let B be a finite set of Boolean func- 
tions such that L 2 C [B] C L. Then Abd(B, Q) is 
©LOGSPACE-rtara! and contained in P. 

Proof. In this case, deciding whether an instance of 
Abd(_B, Q) has a solution logspace reduces to the problem 
of deciding whether a propositional abduction problem in 
which the knowledge base is a set of linear equations has a 
solution. This has been shown to be decidable in polynomial 
time in (Zanuttini 2003). 

As for the ©LOGSPACE-hardness, let B be such that 
[B] = L 2 . Consider the ©LOGSPACE-complete prob- 
lem to determine whether a system of linear equations S 
over GF{2) has a solution (Buntrock et al. 1992). Note 
that ©LOGSPACE is closed under complement, so de- 
ciding whether such a system has no solution is also 
©LOGSPACE-complete. Let 5" = {si, . . . , s m } be such 
a system of linear equations over variables {xi, . . . , x n }. 
Then, for all 1 < i < m, the equation Sj is of the form 
x il + ••• + Xi n . = Ci (mod 2) with a G {0,1} and 
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Fig. 1. Post's lattice showing the complexity of the abduction problem Abd(B,M) for all sets B of Boolean functions and considered 
restrictions M of the manifestations. 



ii,...,i ni S {1, . . . , n}. We map S to a set of affine formu- 
lae T = {ipi, . . . , ip m } over variables {xi, . . . , x n , q} via 

ipi := Xi x © • ■ • © © 1 if Cj = 0, and 

^ii © * ' * © ^i n . if c i = 1- 

Now define 

T' := {ifi ® q \ ipi e T such that ipi(l, . . . , 1) = 0} 

U {<Pi 6 r such that ^(1, . . . , 1) = 1}. 

r' is obviously satisfied by the assignment mapping all 
propositions to 1. It furthermore holds that S has no solution 
if and only if V A is unsatisfiable. Hence, we obtain that 
S has no solution if and only if the propositional abduction 
problem (T', 0, q) has an explanation. 

It remains to transform r" into a set of B-formulae in log- 
arithmic space. Since [B] = L 2 , we have x © y © z € [B]. 
We insert parentheses in every formula ip of V in such a 
way that we get a ternary ©-tree of logarithmic depth whose 
leaves are either a proposition or the constant 1. Then we 



replace every node © by its equivalent _B-formula. Thus we 
get a (B U {l})-formula of size polynomial in the size of the 
original one. Lemma 2 allows to conclude. 

Note that the £>-formulae replacing the connectives might 
use some input variable more than once. Therefore, the log- 
arithmic depth tree is built in order to avoid an exponential 
explosion of the formula size during the replacement. 

Observe that the abduction problem for B-formulae is 
self -reducible for the above cases, i.e., for [B] C L, [B] C E 
and [B] C V. Roughly speaking this means, given an in- 
stance V and a literal I, we can compute efficiently an in- 
stance V' such that the question whether there exists an ex- 
planation E with I G E reduces to the question whether 
V' admits solutions. It is well-known that for self-reducible 
problems whose decision problem is in P, the lexicographi- 
cally first solution can be computed in FP. It is an easy ex- 
ercise to extend this algorithm to enumerate all solutions in 
lexicographical order with polynomial delay and polynomial 



space. Thus, the explanations of Abd(_B, Q) can be enumer- 
ated with polynomial delay and polynomial space if [B] C L 
or [B] C E or [B] C V, according to Proposition 5 and 6. 

Proposition 7. Let B be a finite set of Boolean functions 
such that S o Q [B] C M or Sio C [5] C M or D 2 C 
[B] C M. T/zen Abd( j B, Q) is NP-complete. 

Proof. We first show that Abd(£?,Q) is efficiently verifi- 
able. Let P = (T,A,q) be an Abd(_B, Q)-instance and 
E C Lits(A) be a candidate for an explanation. Define V 
as the set of formulae obtained from T by replacing each oc- 
currence of the proposition x with if -ix E E, and each 
occurrence of the proposition x with 1 if x E E. It holds 
that £ is a solution for V if T' is satisfiable and T'[q/0] 
is not. These tests can be performed in polynomial time, 
because V is a set of monotonic formulae (Lewis 1979). 
Hence, Abd(B, Q) e NP. 

Next we give a reduction from the NP -complete problem 
2-IN-3-SAT, i.e., the problem to decide whether there exists 
an assignment that satisfies exactly two propositions in each 
clause of a given formula in conjunctive normal form with 
exactly three positive propositions per clause, see (Schaefer 
1978). Let tp := f\ ieI c t with c t = x a V x i2 V x i3 , i E /, 
be the given formula. We map tp to the following instance 

V = (T,A, q). Let q, qi, i E /, be fresh, pairwise distinct 
propositions and let A := Vars(<p) U {qi \ i E I}. We define 
Tas 

T:={c t \iEl} (1) 
U {xnVXi2\/qi,Xii\/Xi3Vqi,Xi2VXia\/qi\i E 1} (2) 

U {V ie / A?=i Xii V V ie / ft V q} (3) 

We show that there is an assignment that sets to true ex- 
actly two propositions in each clause of tp if and only if 

V has a solution. First, suppose that there exists an assign- 
ment a such that for all i E I, there is a permutation iTi 
of {1,2,3} such that a(x iw .m) = and a{x iv ./2)) = 
a(x in . (3)) = 1. Thus (1) and (2) are satisfied, and (3) is 
equivalent to \J ieI qi V q. From this, it is readily observed 
that {->x | a(x) = 0} U {-^ i E 1} is a solution to V. 

Conversely, suppose that V has an explanation E that is 
w.l.o.g. full. Then r A E is satisfiable and Y A E \= q. Let 
a : Vars(r) — > {0, 1} be an assignment that satisfies T A E. 
Then, for any x E A, a(x) = if ->x E E, and cr(x) = 1 
otherwise. Since TAE entails q and as the only occurrence of 
q is in (3), we obtain that a sets to each % and at least one 
proposition in each clause of tp. Consequently, from (2) fol- 
lows that a sets to 1 at least two propositions in each clause 
of tp. Therefore, a sets to 1 exactly two propositions in each 
clause of tp. 

It remains to show that V can be transformed into an 
Abd(_B, Q)-instance for all considered B. Observe that 

V £ [B U {1}] and [S 00 U {0,1}] = [D 2 U {0,1}] = 
[Sio U {0, 1}] = M. Therefore due to Lemma 2 it suffices 
to consider the case [B] = M. Using the associativity of V 
rewrite (3) as an V-tree of logarithmic depth and replace all 
the connectives in T by their B-representation (V, A G [B]). 



Proposition 8. Let B be a finite set of Boolean functions 
such that S02 Q [B] or S12 C [B] or Di C [B\. Then 
Abd(_B, Q) is ^-complete. 

Proof. Membership in is easily seen to hold: given an 
instance (T, A, q), guess an explanation E and subsequently 
verify that r A E is satisfiable and r A E A ->q is not. 

Observe that V E [BU{1}]. By virtue of Lemma 2 and the 
fact that [S 02 U{0, 1}] = [Si 2 U{0, 1}] = [DiU{0, 1}] = BF, 
it suffices to consider the case [B] = BF. In (Eiter and Gott- 
lob 1995) it has been shown that the propositional abduction 
problem remains £2" com pl ete when the knowledge base T 
is a set of CNF-formulae. From such an instance (F, A, q) 
we build an instance of Abd(_B, Q) by rewriting first each 
formula as a tree of logarithmic depth and then replacing all 
the connectives A,V and -1 by their ^-representation, thus 
concluding the proof. 

Complexity of the Variants 

We now turn to the study of the complexity of some 
variants of the abduction problem. It is obvious that 
Abd(B,Q) <£s Abd(B,C) and that Abd(B,Q) <\°s 
Abd(B, T). Therefore, all hardness results still hold for the 
variants Abd(I?, C) and Abd(£?,T). Also, it can be eas- 
ily checked that the hardness results in the previous sections 
still hold when the query is required to be a positive literal. 
For this reason the hardness results also carry over to the 
variant Abd(B, £(£)). 

It is an easy exercise to prove that all algorithms that have 
been developed for a single query can be naturally extended 
to clauses. Therefore, the complexity classification for the 
problem Abd(B, C) is exactly the same as for Abd(B, Q). 

Theorem 9. Let B be a finite set of Boolean functions. 
Then, the abduction problem for propositional B-formulae, 
ABD(£, C), is 

1. ^-complete if S 02 C [B] or S 12 Q [B] or D x C [B], 

2. NP-complete if S 00 C [B] C M or S w C [B] C M or 
D 2 C [B] C M, 

3. in P and ®LOGSP ACE-hard if \- 2 C [B] C L, and 

4. in LOGSPACE in all other cases. 

More interestingly, we will prove in the next section that 
allowing terms as manifestations increases the complexity 
for the clones V (from membership in LOGSPACE to NP- 
completeness), while allowing B-formulae as manifesta- 
tions makes the classification dichotomous, P '/X^-complete, 
thus skipping the intermediate NP level. 

The Complexity of Abd(5, T) 

Proposition 10. Let B be a finite set of Boolean functions 
such that V 2 C [B] C V. Then Abd(B, T) is NP '-complete. 

Proof. Let B be a finite set of Boolean functions such that 
V 2 C [B] C V and let V = (T, A, t) be an instance of 
Abd(_B, T). Hence, T is a set of .B-formulae and t is a term, 
t = A" = i k- Observe that E is a solution for V if T A E is 
satisfiable and for every i = 1, . . . , n, T A E A is not. 
Given a set E C Lits(A), these verifications, which require 



substitution of symbols and evaluation of an V-formula, can 
be performed in polynomial time, thus proving membership 
inNP. 

To prove NP-hardness, we give a reduction from 3SAT. 
Let tp be a 3-CNF-formula, ip := /\»ei °i- Let x\,...,x n 
enumerate the variables occurring in ip. Let x[ , . . . , x' n and 
qi, . . . , q n be fresh, pairwise distinct variables. We map <p to 
V = (T,A,t), where 

r ■.= {a[->x 1 /x' 1 ,...,->x n /x' n ] \ i e 1} 

U {xi V V qi,x'i V % I 1 < i < n}, 
A . {x\, ■ • ■ , x n , x-y, . . . , x n }, 
t :=qi A • • • Aq n . 

We show that tp is satisfiable if and only if V has a so- 
lution. First assume that p is satisfied by the assignment 
a: {xi, . . .,x n } ->■ {0, 1}. Define E := {-.a;* | cr(xj) = 
0} U {-ix'i | cr(a;i) = 1} and a as the extension of a map- 
ping &(x'A = -^a(xi) and <r(gi) = 1 for all 1 < i < n. 
Obviously, a \= V A E. Furthermore, T A E \= qi for all 
1 < z < n, because any satisfying assignment of T A E sets 
to either Xi or x\ and thus {a^ V qi, x\ V |= ft. Hence 
E is an explanation for V. 

Conversely, suppose that V has a full explanation E. The 
facts that T A E \= qi A ■ ■ ■ A q n and that each qi occurs 
only in the clauses Xi V qi , x\ V enforce that, for every i, E 
contains -iXi or Because of the clause XiM x\, it cannot 
contain both. Therefore in E the value of x\ is determined 
by the value of Xi and is its dual. From this it is easy to 
conclude that the assignment a : {x\, . . . , x n } — > {0, 1} de- 
fined by a(xi) = if -'Xi G E, and 1 otherwise, satisfies p. 
Finally V can be transformed into an Abd(_B, T)-instance, 
because every formula in T is the disjunction of at most three 
variables and V G [B]. 

Theorem 11. Let B be a finite set of Boolean functions. 
Then, the abduction problem for propositional B-formulae, 
Abd(B, T), is 

1. ^-complete ifS 02 Q [B] or S 12 C [B] or Di C [B], 

2. NP-complete if\J 2 C [B] C M or S w C [B] C M or 
D 2 C [B] C M, 

3. in P andeLOGSPACE-narciif L 2 C [£?] C L, anci 

4. in LOGSPACE in all other cases. 

The Complexity of Abd(B, C{B)) 
Proposition 12. Let B be a finite set of Boolean functions 
such that Soo Q [B] or Sio Q [B] or D 2 C [B]. Then 
Abd(B, £(£?)) is ^-complete. 

Proof. We prove X^-hardness by giving a reduction from 
the E^-hard problem QSAT 2 (Wrathall 1977). Let an in- 
stance of QSAT 2 be given by a closed formula \ '■= 
3xi ■ ■ ■ 3x n Wyi ■ ■ ■ My m <p with p a 3-DNF-formula. First 
observe that Etei • • • 3x„Vyi • • • Vy m p is true if and only if 
there exists a consistent set X C Lits({xi, . . . , x n }) such 
that X fl {xi, — io?i} 7^ 0, for all 1 < i < n, and -iX V p is 
(universally) valid (or equivalently ^Alis unsatisfiable). 

Denote by Tp the negation normal form of -up and let Tp 1 
be obtained from Tp by replacing all occurrences of -iXi 



with a fresh proposition x\, 1 < i < n, and all occur- 
rences of -ijji with a fresh proposition y[, 1 < i < m. That 

is, Tp' J= Tp[-n Xl /x[, ->X n /x' n , — '2/l/2/i, • • • , --J/m/j/m]- 

Thus = Aiei c i where every c- is a disjunction of three 
propositions. To \ we associate the propositional abduction 
problem V = (T, A, ip) defined as follows: 

r := {d t V q I i G /} 

^{xiVx'i\l<i<n}Ll{yiVy'i\l<i<m} 
U {fi Vxi,ti Vx'i,fi Vt t \ l<i<n}, 

A:={ti,fi I 1 <i<n}, 

i> ■= 1 V Vi<i<„(a* A ^) V Vi<i< m (y» A y0- 

Suppose that x is true. Then there exists an assign- 
ment a: {xi, . . . ,x n } — > {0,1} such that no extension 
a': {xi, . . .,x n } U {yi, . . .,y m } ->• {0, 1} of cr satisfies 
-up. Define X as the set of literals over {xi ,x n } set to 
lbycr. Defining E := {^.fi,U \ x l G A}U{^ii,/i | -i^j G 
X}, we obtain with abuse of notation 

= A« e / 4 a Ai<i< n te © ^) a Ai<i< ro (yi © yDA 

Al<i<n, ( r(x i )=l X i A /\l<i< n! a( Xi )=o x 'i 
= -up AX, 

which is unsatisfiable by assumption. As T A£ is satisfied 
by any assignment setting in addition all Xi, x\, 1 < i < n, 
and all yj,y'j, 1 < i < m, to 1, we have proved that E is an 
explanation for V. 

Conversely, suppose that V has an explanation E. Due 
to the clause (/j V U) in T, we also may assume that \E n 
{-■<», "•/ill < 1 for all 1 < i < n. 

Setting X := {xi \ -ifo G E} U {— iar» | -tfj G we now 
obtain Ai< 4 <„ ({fi V Si) A (U V -x 4 ) A (/< Vi»)) A£ = X 
and r A S A -t0 = -195 A A as above. Hence, -k^ A X 
is unsatisfiable, which implies the existence of an assign- 
ment a: {x\, . . . ,x n } — > {0,1} such that no extension 
a': {xi, . . . , x n } U {yi, y m } ->■ {0, 1} of o satisfies 
-up. Therefore, we have proved that \ i s tme if and only if 
V has an explanation. 

It remains to show that V can be transformed into an 
Abd(£>, £(£?)) -instance for any relevant B. Since [Soo U 
{1}] = S01, [Sio U {1}] = Mi, [D 2 U {1}] = Sg,. and 
S01 C Sqi C Mi, by Lemma 2 it suffices to consider the 
case [B] — Soi- Observe that V, xV(yAz) G [B]. The trans- 
formation can be done in polynomial time by replacement, 
rewriting ip as Vi<*<„ 1 V (a* A sj) V Vi< 4 < m 9 V (y, A yj) 
and using the associativity of V. 

Theorem 13. Let B be a finite set of Boolean func- 
tions. Then, the abduction problem for propositional B- 
formulae,ABD(B, C(B)), is 

1. YS^-complete //"Soo Q [B] or Sio Q [B] or D 2 C [B], 

2. in P and ®LOGSP ACE-hard if L 2 C [B] C L, W 
5. in LOGSPACE ;'n all other cases. 



Counting complexity 

The counting problem #Abd(B, Q) we are interested in 
is the following: given an instance V = (T, A, <p) of 
Abd(B, Q), compute #Sol(7 7 ), the number of full expla- 
nations of V. 

Theorem 14. Let B be a finite set of Boolean functions. 
Then, the abduction problem for propositional B-formulae, 
#Abd(B,Q), is 

1. #-coNP '-complete if 'S 02 Q [B] or S 12 C [B] or D 1 C [B], 

2. #P -complete ifV 2 C [B] C M or Si C [B] C M or 
D 2 C [5] C M, 

3. ;n FP ;n a/Z of/zer cases. 

Proof. The #-coNP-membership for #Abd(B, Q) follows 
from the fact that checking whether a set of literals is indeed 
an explanation for an abduction problem is in P NP = A£ 
and from the equality #-A^ = #-coNP, see (Hemaspaan- 
dra and Vollmer 1995). 

We show the #-coNP-hardness by giving a parsi- 
monious reduction from the following #-coNP-complete 
problem: Count the number of satisfying assignments of 
ip(x!,...,x n ) = Vyi---Vy m <p(xi,...,x n ,yi,...,y m ), 
where ip is a DNF-formula (see, e.g., (Durand, Hermann, 
and Kolaitis 2005)). Let x'i, . . . , x' n , Vi, . . . , v n ,t, and q be 
fresh, pairwise distinct propositions. We define the proposi- 
tional abduction problem V = (T, A, q) as follows: 

T := {xi — > Ti,x\ — > r i7 ^Xi V -ix'i 1 < i < n) 

u{^t}u{Ar = i^At^<z}, 

A := {x!, ...,x n }U {xi, . . .,x' n }. 

Observe that the manifestation q occurs only in the formula 
AiLi r i ^ t Q- This together with the formulae Xi — > 
Ti, x\ — > Ti, -<Xi V -ix'i, 1 < i < n, enforces that every full 
explanation of V has to select for each i either Xi and -^x\, 
or -lajj and x\. By this the value of x\ is fully determined by 
the value of Xi and is its dual. Moreover, it is easy to see that 
there is a one-to-one correspondence between the models of 
ip and the full explanations of V . 

Observe that since the reductions in Lemma 2 are parsi- 
monious, we can suppose w.l.o.g. that B contains the two 
constants 1 and 0. Therefore, analogously to Proposition 8 it 
suffices to consider the case [B] = BF. Since T can be writ- 
ten in a normal form, it can be transformed in logarithmic 
space into an equivalent set of B-formulas. This provides 
a parsimonious reduction from the #-coNP-complete prob- 
lem we considered to #Abd(_B, Q). 

Note that the proposed reduction is very similar to the 
one proposed in (Hermann and Pichler 2007). It also proves 
hardness of counting the positive explanations (since there is 
an one-to-one correspondence between full explanations and 
purely positive explanations), as well as hardness of count- 
ing the subset-minimal explanations, (since all solutions are 
incomparable and hence subset-minimal). 

Let us now consider the #P-complete cases. When [B] C 
M, checking whether a set of literals E is an explanation for 



an abduction problem with £>-formulae is in P (see Propo- 
sition 7). This proves membership in #P. For the hard- 
ness result, it suffices to consider the case [B] = V 2 , be- 
cause the reduction provided in Lemma 2 is parsimonious 
and V 2 C [Sio U {1}]. We provide a Turing reduction 
from the problem #Positive-2-Sat, which is known to be 

#P-complete (Valiant 1979). Let tp = Ai=i(P» v u) be 
an instance of this problem, where pi and qi are proposi- 
tional variables from the set X — {x\, . . . , x n }. Let q be a 
fresh proposition. Define the propositional abduction prob- 
lem V — (T, A, q) as follows: 

r := { Pi Vq t V? | 1 < i < k}, A:={xi,...,x n }. 

It is easy to check that the number of satisfying assignments 
for (p is equal to 2™ - #Sol(P). Finally, since V G [B] = V 2 , 
V can easily be transformed in logarithmic space into an 
Abd(B, Q)-instance. 

As for the tractable cases, the clones E and N are easy; 
and finally, for [B] C L, the number of full explanations 
is polynomial time computable according to (Hermann and 
Pichler 2007, Theorem 8). 

Concluding Remarks 

In this paper we provided a complete classification of the 
propositional abduction problem, Abd(B, M) for every set 
B of allowed connectives. We gave results for several re- 
strictions over the representation of manifestations. For in- 
stance our results show that when the knowledge base for- 
mulae are positive clauses (clone V) then the abduction 
problem is very easy (solvable in LOGSPACE), even when 
the manifestations are also represented by positive clauses. 
But its complexity jumps to NP-completeness when man- 
ifestations are represented by positive terms. When look- 
ing at the so-called monotonic fragment (clone M) the ab- 
duction problem is NP-complete when manifestations are 
represented by clauses or terms. Allowing also the manifes- 
tation to be a monotonic formulae as the knowledge base, 
it becomes Ej-complete. This can be intuitively explained 
as follows. The complexity of the abduction rests on two 
sources: finding a candidate explanation and checking that 
it is indeed a solution. The NP-complete cases that occur in 
our classification hold for problems for which verification 
can be performed in polynomial time. If both the knowl- 
edge base and the manifestation are represented by mono- 
tonic formulae, verifying a candidate explanation is coNP- 
complete. Further our results also show that, except for the 
clones L and L c with c G {0, 1, 2, 3}, all tractable cases are 
even trivial. 

In order to get a more detailed picture of the complex- 
ity of abduction in this framework it would now be inter- 
esting to consider restrictions on hypotheses. A classifica- 
tion in which both types of restrictions are considered would 
help to understand how restrictions on the representation of 
hypotheses and manifestations influence the complexity of 
the propositional abduction problem. When restricting the 
hypotheses, as for instance when requiring the explanation 
to consist of positive literals only, it may happen that only 
one maximal candidate has to be considered, thus leading 



to further trivial but also some coNP-complete cases. Also 
note that the upper bounds in the case of the clone L relies 
on Gaussian elimination (Zanuttini 2003). This method fails 
when restricting the hypothesis to be positive. Determining 
the complexity of abduction for the clone L when manifesta- 
tions are represented by L-formulae and explanations have to 
be positive might hence prove to be a challenging task (no- 
tice that this case remained unclassified in circumscriptive 
inference (Thomas 2009)). 
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Table 1. The list of all Boolean clones with definitions and bases, where h n := VI=i A^=i j^ti x i an d dual(/)(oi, . . . , a„) = 
-i/(-iOl . . . , -io„). 



